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Some Definitions ■} 
yThb^XJt of the term evaluate suggests that.the function of evalua- 
tion lyj to place a value on the thing being evaluated. But there are 
two senses in which an entity's value may be conceptualized. ,On the 
one hand, the object may have value of its own, implicit, inherent, 
independfS^ of any requirements of applicability or use. So for ex- 
ample, "pure" science, as the adjective gmre suggest^, has value un- 
tainted by any considerations outside the sphere of science Itself. It is 
enough that scientists appreciate (esteem, admire) the pure discovery 
or development for' its own sake. We shall apply the term merit to this 
kind of intrinsic^, context-free value. 

On the other hand, the entity may also have value within some 
context of use or application; thus, "applied" science is undertaken for 
the sake of solving some practical problem. Th« product^ of applied 
science are valued to the extenjt to which they provide solutions, i.e., 
have utility in a practical context. While some scientists may denigrate 
these products as having "no theoretical significant" (i.e., as not being 
meritorious), they may nevertheless have great value in an engineering 
or indutrial setting. We shall apply the term worth to this kind of 
extrlnsk: or context determined value. ^ ^ 

'Some additional examples will help to make the distinction we are 
drawing clear. Consider fout quite different entities that might be - 
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Scriven (197^) has made a similar distinction except that he uses 
the terms merit and value in the same sense that we use merit and wor- 
th. We believe the Tftter formulation to be superior because it avoids 
confusior) between the terms value and evaluate . Valve seems to be a 
more geperic term in this context; the placing of value is waht results 
from the evaluation process. We agree with Scriven, obviously, that 
there are two^ ways to value; we prefer to avoid the redundancy and 
confusion engenerdered when one of the 'subtypes Is called by the same 
name as the more generic type. 



evaluated: a professor, gold, a mathematical proof, and a language 
arts curriculum. A professor might be judged for merit on his or her 

'X7 — 

scholarliness, i.e. , his or her standing in the academic community of 
peers and his or . her relative contribution to the discipline. Obviously 
merit so determined would accompany the professor wherever he or she 
happened tp .be employed; scholarliness is an intrinsic characteristic of 
the person. But her or she mfgfvTbe judged for wpxth on such factors, 
as: that, as a woman, she provides a good role model; that she rates 
high on entreprenuership , bringing in many thousands of . dollars an- 
nually In outside grants; or that she teaches in a high-demand field, 
that is, one in which student enrollments are burgeoning. Worth de- 
pends on what is thought to be important in the local context; a college 
already having many women, on its faculty, that -cared , little abqut at- 
tracting outside funds, and that maintained an upper limit on its en- 
rollments would not find this particular professor especially worthy.^ 

Gold might be judged for merit on its inherent beauty--it is at- 
tractive, and also has properties that permit it to be easily fashioned 
into objects of beauty, such as jewelry, plate, or statuary. It is 
judged for worth, however, i^n the trading marts of London, Zurich, 
and Frankfocl, and daily fluctuations in it worth ar^ worthy subjects 
for newspaper accounts ^orldwidei.' As you are well atvare, the current 
worth of cold is well in excess of $300 an ounce. 

A methematical proof might be judged for mer it on its parsiraonious- 
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A little though reveals that" with respect to the evaluation of a 
professor, it is primarily merit which is at stake In a promotion decision 
and worth in a tenure decision'. Most^ universities contl^iue to confuse 
these two concepts and make tenure and promotion decisions virtually 
equivalent In terms both of criteria and decision processes. See Scri- 
ven (1978) for an' excellent discussiqn of this problem. 
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ness and Its elegance. Given two proofs of the same proposition, the 
shorter Is preferred, and a direct proof is considered more elegant than 
an indirect one. Boolean algebra, when first proposed, was greatly 
admired, not for its utility (that came later) but for its inherent logic 
and coherence; it was^ in a word, an. elegant formulation. That same 
proof, however, is ^jcJged for worth on its applicability in a practical 
context. The mathematics of cohiplex (imaginary) numbers, for exam- 
ple, was considered a mere mental toy until it proved useful in, solving 
the- equations of alternating electrical circuits which had, until that 
time, withstood conventional analysis. 

A language arts curriculum might be judged for merit on its sim- 
plicity, e.g., ills straightforward form, targeted scope, lack, of convo- 

* ... 
luted writing, and its' degree of integration, \hat is, the extent to 

which all of its slibstantive and fol^mal components are properly artic- 
ulated or "fit." But it is likely to be judged for worth on such criteria 

/ 

as the extent to which it produces stuflent learning, it appropriate to 
the ability level of \he students with whom it will be used, is free from 
bias (especially bj^s toward ^ the sex, ethnicity, national origins, re- 
ligion , or culture of the students exposed to it), and is teachable b^ 
the average teacher in in adopting system. 

■ \ ) 

Merit and Worth as Plural istic Phenomen a 
v - 
Since both merit and worth are statements about different aspects 

of values, it is df interest to inquire whether their estimates, once 

determined, are fixed, or whether they can be altered as the result of 

any particular circumstances. Does the value of an object, in either 

the sense oi merit or worth, remain constant, or can either merit or 

worth, or^both, shift from time to time or place to^lace? 
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In one sense, both merit and worth are variable. While merit Is 
an estimate of In^lnsic value, and would^ therefore seem to be an immut- 
able property of the entity whose merit Is being assessed, it is clear 
that persons competent to judge merit, for example, academic peers 
judging scholarliness or mathematicians judging the elegance of a proof, 

may differ among themselves both In stating Indicators of merit and In 

., ... . ". _. 

assessing the merit of any particular entity In terms of those indl- 

cators. Just how doe^ one tell whether scholarliness characterizes any 

professor, and how does one assess Professor X on the possession, of 

those characteristics? Different responses to these questions will lead 

to substantially different judgments of merit. 
*^ 

Similarly, one may ask how one determthes Indicators of worth and 
<• 

assesses a. particular entity in their terms. What does it mean to be a 
good role model, in the case of a professor, and how can one tell 
whether a curriculum Is appropriate to the learning ability of the young- 
sters exposed to It? How does one tell whether this particular language 
arts curriculum possesses whatever characteristics one believes are 
found in good currlcular? Again, judgments about both Indicators and 
the degree to which Curriculum X conforms to those iTidkators are 

■t 

likely to vary depending upon who makes the judgfnents. 

From this - analysis one might conclude that merit and worth are 
equally variable. But there is a sense In which judgmeVlts of worth are 
a great deal more / variable than are judgments of nierit. That con- 
tingency arises from the fact that judgments^of merit are tied to Intrinsic 
c haract eristics of the entity Itself, but jud^fments of worth depend upon 
the ih teractlon of the entity with some context. A professor, whatever 
his or\er merit, has different worth depending on whether that worth 
is being jvidged In the context of a university 




seeking a scholar, one seeking an entrepreneur, or one seeking some- 
one who can teach several sections of beginning business/ law. The 
worth of gold, whatever its beauty, depends on what speculators '^e 
willing to pay for it on any given day. the worth of a mathematical 
proof, whatever its elegance, depends on the existence of a real-world 
problem (which might be another -mathematical problem) which it can 
help solve; Boolean qlgebra was relatively worthless, although extremely 
meritorious, until its relationship -to verbal logic was exploited. And a 
language arts curriculum, however simple and wellintegrated , is rel- 
atively worthies with a group of blatk ghetto youngsters if it was 
designed for use with an upper middle class white clientele. 

What is impor^nt to note in all these instances is that whUe merit 
remains more or less constant, at least in the sense that it is not 
unreasonable' to expect that consensus about any object's merit can be 
reached, worth can and does change dramatically: change the 
^PPJ^^^^i and,_y.oy„ c^^^ There are several immediate and 

crucial consiquences of this lact: 

I . The determination of worth requires an ad hoc evaluation at 
every site in which the entiry being evaluated is contemplated for use. 
It is impossible to warrant an entity for use generally, once and for 
all. To be sure, it is possible to conduct an' evaluation to determine 
the worth of the (intity in a number of different situations, and then to 
describe the entity's worth in each of those different contexts. But In 
that case, before worth in Context X could be inferred, one would need 
to have a great deal of descriptive, information (often called- "thick 
description") about the settings in which the Valuations had taken 
j^lace, and all-out Context X, so that the degree of fit between X and 

( ■ ■ 



*any ev«tluated context could be determined. Thus, local, specific 
evaluations to determine worth cannot be avoided, 

« 

2. One very important characteristic of local sett^nge* is the 
values Held by the se\»eral aadiences--stakeholders-,-to whom an eval- 
uation may be addressed. It has become patently clear over the past 
eiecade and a half that the United States js not a value consensual 

, society but a pluralistic one. Hamilton (19?7) has stressed the fact, 
that so-called responsive models of evaluation have appeared in recent 
years in partial response to the recognition of this country's pluralistic 
value structure. Worth must be assessed, at least ih part, in relation 
to thot;e differing values. It must of course also*^be assessed in rela- 
tion to other contextual factors, e.g., the. nature of the student body, 
the qualifications of the teachers, the extent of available resources, and 
the liice. ^ 

3. The fact that pleualistic values e>dst leads to tl;ie inescapable 
con(!lusion that a particular entity may be judged worthy by one group 
of staiceholds but worthless^ by another group holding conflicting val- 
ues. In general, there can- be no consensual determination of the worth 
of an entity. And just as there can be different assessments within a 
particular setting because of different values held by important local 
reference groups,, even more so is it likely' that different assessments 
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This fact has enormous implication's for innovation dissemination 
strategies based on the assumption that innovations can be engineered, 
tested, warranted for general use, and disseminated to potential adop- 
ters. Each adoption situation must be assessed to determine whether 
the innovation fits the situation; the mere fact that it has fit in some 
situations is no guarantee that it will fit in this situation. 

\ t 
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will be reached between sites because of different values held in^ch. 
Thus, while there may be differences among st;akeho!der8 In, say, 
Peoria, Illinois, those difference are likely to be smaller than the diff- 
erences between Peoria ^nd Birmingham, Alabama, because these two 
exist in rath*er different cultural settings. It should be noted that 
these different judgments of worth, even confliGting judgements of 
worth, may all exist despite the level of mferit of the entity; meritori- 
ousness does not lead Automatically to judgments of worthiness. 

4. MethQdologically , the most important consequence of the pre- 
ceding deductions . is that evaluations of worth must be grounded in 
^IM^^^studjes o£Jo^ While an exp'erienced evaluator may be 

able to generate a priori hypotheses about the value (and 'other con- 
textual factor) positions in a given setting, in most cases the evaluator 
wi>ll want, at the very least, to check his or her hypotheses, if not 
generate them de novo, by close study of and interaction with the 
context itself, especially with the people who inhabit it. And field 
studies call, in general, for naturalistic approaches supported by quali- 
tative methods, as in anthropology, rather than for scientistic ap- 
proaches supported by quantitative methods, as in experimental psy- 
chology. 

TJ^AJDet(y;minaUq^^ . 
Merit, an intrinsic property of the entity being evaluated, is 
determir^d in one of two ways: by assessing the degree to which the 



The concept of grounding has emerged as crucial to in^^iiries 
generally. See Glaser and Strauss (1967) for a thorough explication of 
this term. . • 
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entity conforms to certain standards upon which a relevant professional 

group or group of experts agree, which - might bJ called 'a bsolute merit 

evaluation, or by comparing the entity to other entities within the same 

♦ 

class, \vhich might be called comparative merit evaluation. So for ex- 
ample, in determining the merit ^of a professor, we. might canvass a 
, group of professors about what they take to be n\eritorious character- 
istics; whatever emerged, e.g., scholarViess , woiild then bemused as a 
bas'is for testing the performance .of actiial professors. Or, in the case 
of^ a language arts curriculum, we might canvass a *group of language 
arts educators, writers, editors, and so on, to determine what the 
characteristic's of a meritorious curriculum are. A proposed curriculum 
could then be tested against the named criteria. Both these instances 
are examples of absolute mtfrit evaluation. But we might in many prac- 
tical situations be more interested in comparative judgments. Gjven 
t^at there are three profes"sors to be considered fc^ merit increments 
this year, but that there is enough money only for one increment, we 
may not care what each professor's absolute standifig is on scholarliness 
but might rather wish to focus on their relative scholarliness in order 
to be able to decide among them. Or, in the case of curricula, we may ^ 
not care what the absolute degree of integration of a curriculum is but 
only whether it is more or less integrated than another curriculum 
currently in use. 

Worth, an extrinsic property of the entity beingT evalua^, is 
determined by comparing the entity's impact or outcomes relative to 
some set of external requirements, e.g., the results of a needs assess- 
ment oKa context evaluation. What is of interest is an assessment of 
the entity's benefits with respect to 9 set of criteria. These criteria 
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are not drawn from a professional group or a group of experts but from 
the variety of local stakeholdlng groups that are related to or affected 
by the entity being evaluated. So, while merit criteria may be rela^ 
tlvely stable, criteria of worth are highly variable depending upon 
which stakeholdlng group is being assessed, and in which context. 



Thus, the minority faculty in an Institution may insist that newly re- 

.... . ' _ . ... .. _ _ 



criTlted faculty must- provide good minority role models, while the Vice- " 
President for Research and the Vice-President for Administration may 
be much more concerned about their ability to attract outside grants. 
The NAACP in a local community may be mdre concerned about whether 
a proposed curriculum is free from cultural bias than are the teachers, 
who may instead focus upon the curriculum's appropriateness for the 
learning levels of their students and on their owi^ competence to teach 
using the new materials and techniques. 

^ There is of course no guarantee that any statement of criteria of 
worth that might be made by a stakehoding group represents its actual • 

operating position. Scriven (1978) in a discussion of merit vs. value 

5 * 
(worth), as these relate to- faculty evaluation ^ in a uni.versity, r^otes 

■ ' ■■ 

that four different kinds of value (wortJ|) criteria emerge: 

o Alleged or r hetorical value s: t^e value^; system to which an 
institution publicly subscribes . ' So foij* I: example , "Teaching 
is very important at" Berkeley---we never take personnel act- 
ion without evidence about teaching." (p/ 23) 

o actual or truer values : the values that may be deduced from 
the Institution's actual practices. "Solid evidence of first-rate . 
resewrch if required, only weak evidence of minimal teaching 

per- ^ . , 

formfance is required."' (p. 23) "~ 



\ 
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See Footnote V, above 
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o inteiests of the IhsUtution: the "set of factors the promotion 
tTon "of wKicK" waulT actually be valuable for--beneficial to the 
interests or welfare of--the instltuYion (a really massive com- 
mitment to improving undergraduate education to the , poiht of 
hiring (promoting, tenuring) for tejaching talent, with research 
ignored, the use of a ^?eVious process of evaluation for. teach- 
. ing, etc.-)." (p. 23) i . ■ ' 

■' .' ' 

^ y^lii«s : the normative value system fhat the institution 

sHouIof Tiave. "Th«re is the possibility that the morally, legal- 
ly, socially and/or educationally correct set of (personnel) 
values for Berkeley is the research set." (p. 23) 

•In general,, we would argue, the evaluator should ^void being 

taken in by . aU^ged ^^)r rhetc^rical v<iilues, study pj;xK:tices to determine 

the actual or true values, determine-irtsofar.^ as'lie or she canVhat Is-in 

the interest .of the institutii)*^ or stakeholding #fuop --that holds* th^^^^ 

■ - ■' * / .. . • 

values .and convey that information- to them, and finally, de'gr^ne 

whenever possible a normative set of values for the situation ahd'^coTn- 
pare the actual or true values to it. The latter step may not be pos- 
sible in many situations because reference to "ideal" values ynust-at 
once raise the question, "Ideal for whom, or in what sense?" An- 
swering, this question may. be well beyond the scope of the typical 
evaluations. - . . ' ; • - ; 

In practice at least three different ways may be .found in which 

merit and worth are lirtked: 
' ■. ^ ■ 

M^iLlDA-.^rlll_]iiken Jo.!!^,^^ that is,' an 

institution^or stakeholder group elects to define thct- two a .equivalent, ' 
so that no ^nefits are taken into account . Scriven, (1^78) apparently " 
beleives this t& be th^ case at Be nice ley for personnel decision^; thus 
both promotion and"1:enure at that institution seem to be dependent 
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-upon assessments of research and scholarly productivity. Many instl- 

tuitions make this definition by default wheh, In their faculty hand- 

books, they treat promotion and tenure (iecisiOns as rfqulVal^t and pose 

precisely the game criteria (usually teaching, research, and service) for, 

both. Merit, and worth are <)ften defined as equivalent by default in 

dissemination programs for Federally funded innovations, for^ seems to 

be assumed \hat a meritorious innovation can be warrailted for general 

use in the schools* (recall for example t^e "proven products ^list" of the 

Office of Education or the more recently established Joint Dissemination 

. ■* ■ ■ . z^- 

Review Panel's etffort to certify Innovations that have been evahaated 

6 * 
positively). Jn either case, it is apparei^ that the definition of worth 

as equivalent td merit is probably an eif*rpr; the distinction is worths 
preserving . 

^.^?^_.-^^^lerit arid^ , 
In a tenure case, for example, it might be argued that benefits derived 
from the presence of a certain faculty member (as a minority group role', 
model-, say) are so powerful that it really does not matter what his or 
her merit is. In the most difficult days of affirmative action there 
seems to be little dOubt that many universities took just this ptfeture. 
In the case of a currcular innovation, a local school system, under 
gr^at public pressure for accountability and anxious to do something to 
demonstrate its commitment to change, might well jump on the band- 
wagon regardless of the merit of the innovation. Obviously the long- 
run negative effects of such posutres are serious; no institution or 
reference group can long take such a posture and .survive. 



It should be noted that the National Diffusion Network, also 
Federally supported, takes the posture that local school systems should 
be encouraged to look at JDRP approved innovations that appear to be 
responsive to some locally identified need and then to rfia'ke whatever 
adaptations are necessary, 

\ 
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It is ^8o possible to define rnerit as Independent of worth. Uni- 
versity faculty member^ may stretch to defin« a position for which staff ' 
recruitment is to occur in such a way that institutional benefits are 
•overlooked for' the sake of getting a "big-name scholar." Similarly the 
worth of an innovation in a local setting may be downgraded, virtually 
to zero, simple because the proposed innovative curriculum has "gotten 
such* good grades in natfonal evaluations." 

As Scriven (1978) suggests, merit standards drive out standards' 
of worth in some situations, while in other cases standards of worth 
drive out merit standards. Neither situation is desirable. 

^a£^_3j^ Worthy 

PAyAoV'^eL beneff^ Without the minimum level of merit, worth is auto- 
matically zero. Thus, a professor who was not minimally meritorious 
(as demonstrated, say, by the fact that he or she was continuously 
promoted) would not be considered for tenure regardless of what other 
benefits might accrue to the ^stitution by virtue of his or her pres- 
ence. But minimal merit would not be enough; well-defined benefits 
would also have to be demonstrated to lead to a positive tenure^. de- 
^cision. In the cast of a curriculum, local adoption would not occur for 
a porgram that was not minimally meritorious, but adoption would also 
not occur unless other benefits could be demonstrated as well. 

It- is also the case that decisions about merit are usually not made 
in the absence of simultanisous considerations of worth. Universities do 
not usually hire a professor simply on merit; some possibility that this 
professor could also serve some university purpose muust also exist. 
And no one would undertake to develop a curriculum no matter what its 
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promise on merit i^tiless he or she had reason to believe that the currlc- 
ulum would also be useful in some setting. 

The methodological problem posed foi;. the evaluator . in this more 
•typical real, world case (Case 3) is to avoid becoming confused about 
the separate components, that go into "balanced" judgments simply 
because th^y are interrielated . While merit decisions reinforce decisione 
of worth, and vice-versa, they are seprate decisions, made on sep- 
arate criteria, and require different methodological approaches to be 
established. 

To For mulate and Summatlve Evaluation 
The relationship of merit and worth to the concepts of formative 
and summative evaluation (Scriven, 1967) is complex. It is easy to be 
mtsled intd believing that merit evaluations are formative and worth 
evaluations are summative. The aim of formative evaluation is, after 
all, refinement and Improvement, which makes one think of intrinsic 
aspects of entities that can be refined and improved. The aim of 
summative evaluation is to determine impact or outcomes, which makes 
one think of contexts in which such impacts or outcomes may be noted. 
But in fact the dim^ensions of merit/worth and formatlve/summative are 
orthogonal; evaluations of merit can be either formative or summative 
just as can evaluations of worth. 

This assertion can perhaps be best understood through an ex- 
ample. We turn by way of Illustration to the case of an innovative 
program or curriculum jvhlch is to be developed by a national team and 
theV\ to be considered for adoption in a local setting- -a quite common 
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situatlbn. Evaluations th»t seek to establish the merit of such an 
innovative progrann j or curriculum might properly be ' termeji devel- 
opmental evaluations- -evaluations occurring during the process /of de- 
velopment that seek to improve or refine the ehtity to its opthnal state 
as an ijistance of its kind. Evaluations thaV'seek to establish the worth ' 
of such an entity may be thought of as adop tive eval- 
uations- -evaluations occurring after a completed entify summatively 
evaluated" for merit is. availably that relate to the use of the entity 
lO- AOIT'*?. conc^ Both developmental and adoptive evaluations 

may be carried out for either summative or formative purposes; four 
types are determined by crossing the merit/worth and formative/ summa- 
tive dimensions. We identify and analyze these four types below in 
terms of their purposes, their audi^ances, and the sources from 
^.bJ^il iud^|T|^ei^^^ 

1 - The purpose of fonpati^ de^ is to mod- 

ify and improve the design of an innovative .pro'^i::am or curriculum 
while it is still under development. The audience for such an eval- 
uation is the national developmental team, since they are the agents who 
can act on th| evaluation information to make whatever changes are 
indicated. The sources of standards are panels of professional peers or 
other expert groups, who have reason to be well acquainted with the 
characteristics of meritorious innovations of that type, e.g., curriculum 
experts, substantive experts, substantive experts, other evaluato'rs, 
and the like. While such groups no doubt will not agree on every 
detail they are most competent to specify what shall be taken into 
account in determining merit. 
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2. The purpose of summa tlve de velopmental evaluatio ns Is to criti- 
que a completed entity in terms of professional or expert standards so 
as to be able to certify and warrant merit. The primary audience for 
this evaluation the group ofpotential" adopters who need to be re- 
assured about merit before they car\ reasonably consider the worth of 
the innovative program or curriculum in their own contexts.' The 
?.9.y.rce ?L.8tandar^^ continues to be panels ^ professional experts. 

3. The purpose of formative adoptive evaluatlpns is fit or adapt 
the program or curriculum to a local context or situation. As we have 
noted, there will always be local contextual, including value, differ- 
ences, and no innovation could hope to fit all^ontexts. What is re- 
quired is a local assessment of context and values so that the fit of the 
innovation can be determined. Refinements, adjustments, and other 

♦ 

adaptations will be called for to optimize the fit; a local adaptation team 
is the proper auciience for formative adoptive evaluation informations, 
and the local assessment of context and values (which might be con- 
strued as a needs assessment) becomes the sour ce for standards. 

4. The purpose of summat^ is to certify or 
warrant the adapted program or curriculum for permanent local use. 

audience for this evaluation is that group of local decision^ makers 
(probably some from each of the stalCeholding audiences) which will 
make or shape the final decision about -whether to permanently adopt 
the innovation. This group will vary widely in composition ahd clarity, 
from place to place depending upon how decisions happen to be made in 
each locale. The s ource of st andards for this evaluation is a local 
needs assessment, for unless the innovative program or curriculum 
meets local needs there can be no justification for adopting it. 

This example focuses on the adoption of an innovative program or 
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curriculum, but the distinctions which nre illustfated by it apply when- 

* .. " ■ « 

ever merit and worth are at issue. In the case of the University 
pr^essor, as a further example, one can imagine a .formative merit 
evaluation in which the professor is himself or herself Che audience (the 
professor is in the final analysis responsible for his or her own be- 
havior), the 'purpose is to provide feedback l^at will help him or her to 
improve in the sense of becoming more meritorious , and the "slandars 
a-re the professioi^l standards of his or her own peer group, other 
educational psychologists, say. Many universities now provide for 
annual reviews, sometimes in connection with awarding salary incre- 
ments, that serve this fupction. The summative merit review occurs 
when the professor is formally considered for promotion in rank. 
Summative worth pviews are made when the professor is considered for 
tenure, and formative worth reviews can also be regularly scheduled 
(although they usually are not) to provde the professor feedback about 
how he or she is doing with respect to Unversity criteria or worth 
(e.g., role modeling, entrepreneurship , carrying a fair teaching load, 
and the like). Similar examples could be provided in virtually every 
evaluation situation. 

Summary ♦ 
In this paper we have attempted to distinguish between two kinds 
of value determinations: Those of merit and of worth. Merit, we havp 
suggested, is an intrinsic measure of value which inheres in the eval- 
uated entity with its context and thus may vary dramatically from con- 
text to context. \ye gave exampLs of both merit and worth in relation 
to four, evaluated entities: a university professor, gold, a mathematical 
proof, and a language arts curriculum. r 
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; We pointed out that while both merit and worth are somewhat viJjjL 
able worth is eapeclally variable because of its Urtlque dependence lipS* 
the context in which it is assessed' As A\^8ult of this fact, we sug- 
gested, four important consequences resulted: the determination of 
wl^rth requires^ a separate, ad hoc evaluation In each proposed context, 
that is, that worth cannot be determined once-and-for-all; that the 
values of stakeholders form an Important part o'f each local* context f 
that \kn entity may be judged worthy by one group or subgroup but 
completely unworthy by another, because of value conflicts; and that 
(and from our point of view, most important) evaluations of worth must 
be grounded in field studies of local contexts and cannot be carried out 
by the cqnventional experimental, pre-post inquiry modes that have 
dominated evaluation the past. 

We went on to suggest that merit may be determined by absolutely, 
in respect to judgments made by professional peers and experts, and 
comparatively, in respect to other entitles of the same type as that be- 
ing evaluated. Worth is determined in relation to a set of benefits that 
might be derived, as stipulated by stakehoMlng audiences. We sug- 
gested a caveat, however: that stated positions of stakeholders were 
not necessarily their true positions, and that the evaluator needed to 
keep these two sets (as well as institutional interests and ideals sep- 
arate. 

Merit and worth were found to be related in practice. In x)ne of 
three ways. They are sometimes taken as synonymous terms, and, at 
the other extreme, are often taken as cpmplately Independent. A more 
realistic and meaningful posture consists in defining worth to depend on 
some minimal level of merit but to include, in addition, certain other 
benefits defined differently In each local context. 
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Finally, w© related the merit/worth dimension to the classic forma- 
live summatlve dimension, found them to be orthogonal, and so gen- 



erated four types of evaluation. These were illustrated in the e^st of 
an innovative program* or curriculum and found to differ significantly 
along dimensions of purpose, audience, and sources of standards. It 
was asserted that similar analyses could be made of the four types in 
virtually every evaluation situations. 

The analyses and arguments we have made here have been brief 
but, we feel, compelling. The distinction between merit and worthHs 
not only conceptually intriguing but operationally heuristic. If taken 
seriously, the differences noted have enormous implications for both the 
theoryjand the methodology of evaluation, particularly, we believe, in 
pointing toward the need for more naturalistic and less experimentally- 
(Oriented evaluation. 
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Note 



This paper is based on Chapte^ 4 of a book^ln preparation by the 
present authors tentatively errtitled Naturalis tic .Evaluation, to be pub- 
lished by Jossey-Bass Publishers in 1980. ' 
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